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SEQUENCE LISTING 



<120> Method and Apparatus for Extracting and Evaluating Mutually 
Similar Portions in One-Dimensional Sequences in Molecules and/or T 
hree-Dimensional Structures of Molecules 

<130> 522 . 1921D2 

<140> 09/910,071 
<141> 2001-07-23 

<150> US 08/014,867 
<151> 1993-02-08 

<160> 20 

<170> Patentin version 3.1 

<210> 1 

<211> 37 

<212> PRT 

<213> human 

<400> 1 

Gly Asp Val Glu Lys Gly Lys Lys lie Phe lie Met Lys Cys Ser Gin 
15 10 15 



His Thr Val Glu Gly Gly Lys His Lys Thr Gly Pro Asn Leu His Gly 
20 25 30 



Leu Phe Gly Arg Lys 
35 



<210> 2 

<211> 39 

<212> PRT 

<213> bacterium 




<110> Tomikawa, Mayumi 
Aikawa, Seiichi 
Matsuzawa, Fumiko 
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<400> 2 

Glu Gly Asp Asp Ala Ala Ala Gly Glu Lys Val Ser Lys Lys Cys Leu 
15 10 15 



Ala Cys His Thr Phe Asp Gin Gly Gly Ala Asn Lys Val Gly Pro Asn 
20 25 30 



Pro Asn Leu Phe Gly Val Phe 





35 


<210> 


3 


<211> 


51 


<212> 


PRT 


<213> 


rat 


<400> 


3 



Met Ser Leu Ala lie Leu Arg Val lie Arg Leu Val Arg Val Phe Arg 
15 10 15 



lie Phe Lys Leu Ser Arg His Ser Lys Gly Leu Gin lie Leu Gly Arg 
20 25 30 



Thr Leu Lys Ala Ser Met Arg Glu Leu Gly Leu Leu lie Phe Phe lie 
35 40 45 



Gly Val Val 
50 



<210> 4 

<211> 142 

<212> PRT 

<213> unknown 

<220> 

<223> Figure 23A 

<400> 4 
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Thr Glu Glu Gin lie Ala Glu Phe Lys Glu Ala Phe Ser Leu Phe Asp 
15 10 15 



Lys Asp Gly Asp Gly Thr He Thr Thr Lys Glu Leu Gly Thr Val Met 
20 25 30 



Arg Ser Leu Gly Gin Asn Pro Thr Glu Ala Glu Leu Gin Asp Met He 
35 40 45 



Asn Glu Val Asp Ala Asp Gly Asn Gly Thr He Asp Phe Pro Glu Phe 
50 55 60 



Leu Thr Met Met Ala Arg Lys Met Lys Asp Thr Ser Glu Glu Glu He 
65 70 75 80 



Arg Glu Ala Phe Arg Val Phe Asp Lys Asp Gly Asn Gly Tyr He Ser 

85 90 95 



Ala Ala Glu Leu Arg His Val Met Thr Asn Leu Gly Glu Lys Leu Thr 
100 105 110 



Asp Glu Glu Val Asp Glu Met He Arg Glu Ala Asn He Asp Gly Asp 
115 120 125 



Gly Gin Val Asn Tyr Glu Glu Phe Val Gin Met Met Thr Ala 
130 135 140 



<210> 5 

<211> 159 

<212> PRT 

<213> unknown 

<220> 

<223> Figure 23B 

<400> 5 
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Ala Met Asp Gin Gin Ala Glu Ala Arg Ala Phe Leu Ser Glu Glu Met 
15 10 15 



He Ala Glu Phe Lys Ala Ala Phe Asp Met Phe Asp Ala Asp Gly Gly 
20 25 30 



Gly Asp He Ser Thr Lys Glu Leu Gly Thr Val Met Arg Met Leu Gly 
35 40 45 



Gin Asn Pro Thr Lys Glu Glu Leu Asp Ala He He Glu Glu Val Asp 
50 55 60 



Glu Asp Gly Ser Gly Thr He Asp Phe Glu Glu Phe Leu Val Met Val 
65 70 75 80 



Arg Gin Met Lys Glu Asp Ala Lys Gly Lys Ser Glu Glu Glu Leu Ala 

85 90 95 



Asp Cys Phe Arg He Phe Asp Lys Asn Ala Asp Gly Phe He Asp He 
100 105 110 



Glu Glu Leu Gly Glu He Leu Arg Ala Thr Gly Glu His Val Thr Glu 
115 120 125 



Glu Asp He Glu Asp Leu Met Lys Asp Ser Asp Lys Asn Asn Asp Gly 
130 135 140 



Arg He Asp Phe Asp Glu Phe Leu Lys Met Met Glu Gly Val Gin 
145 150 155 



<210> 6 

<211> 28 

<212> PRT 

<213> unknown 
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<220> 

<223> calmodulin probe site 81-108, Figure 25, target 
<400> 6 

Leu Ala Asp Cys Phe Arg lie Phe Asp Lys Asn Ala Asp Gly Phe lie 
15 10 15 



Asp lie Glu Glu Leu Gly Glu lie Leu Arg Ala Thr 
20 25 



<210> 7 

<211> 28 

<212> PRT 

<213> unknown 

<220> 

<223> calmodulin probe site 81-108, probe, Figure 25 
<400> 7 

lie Arg Glu Ala Phe Arg Val Phe Asp Lys Asp Gly Asn Gly Tyr lie 
15 10 15 



Ser Ala Ala Glu Leu Arg His Val Met Thr Asn Leu 
20 25 



<210> 8 

<211> 27 

<212> PRT 

<213> unknown 

<220> 

<223> Calmodulin probe site 117-143, Figure 26, target 
<400> 8 

lie Glu Asp Leu Met Lys Asp Ser Asp Lys Asn Asn Asp Gly Arg lie 
15 10 15 



Asp Phe Asp Glu Phe Leu Lys Met Met Glu Gly 
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20 25 



<210> 9 

<211> 27 

<212> PRT 

<213> unkown 

<400> 9 



Val Asp Glu Met lie Arg Glu Ala Asn lie Asp Gly Asp Gly Gin Val 
15 10 15 



Asn Tyr Glu Glu Phe Val Gin Met Met Thr Ala 
20 25 



<210> 10 

<211> 8 

<212> PRT 

<213> unknown 

<220> 

<223> Coordinates of C-alpha corresponding to the aminon acid resi 
due N 

OS. 7 to 14 in elongation factor of protein which is a bindi 

ng si 

te for phosphoric acid of GTP 
<400> 10 



Gly His Val Asp His Gly Lys Thr 



1 5 



<210> 11 

<211> 8 

<212> PRT 

<213> unknown 



<220> 

<223> adenylate kinase - target 
<400> 11 
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Gly Ala Pro Gly Ser Gly Lys Gly 

1 5 



<210> 12 

<211> 8 

<212> PRT 

<213> unknown 

<220> 

<223> ras protein - target 

<400> 12 

Gly Ala Gly Gly Val Gly Lys Ser 
1 5 



<210> 13 

<211> 8 

<212> PRT 

<213> unknown 

<220> 

<223> adenylate kinase (3ADK) 

<400> 13 

Gly Gly Pro Gly Ser Gly Lys Gly 
1 5 



<210> 14 

<211> 223 

<212> PRT 

<213> unknown 

<220> 

<223> amino acid sequence of trypsin 
<400> 14 

lie Val Gly Gly Tyr Thr Cys Cys Ala Asn Thr Val Pro Tyr Gin Val 
15 10 15 
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Ser Leu Asn Ser Gly Tyr His Phe Cys Gly Gly Ser Leu He Asn Ser 

20 25 30 



Gin Trp Val Val Ser Ala Ala His Cys Tyr Lys Ser Gly He Gin Val 
35 40 45 



Arg Leu Gly Glu Asp Asn He Asn Val Val Glu Gly Asn Glu Gin Phe 
50 55 60 



He Ser Ala Ser Lys Ser He Val His Pro Ser Tyr Asn Ser Asn Thr 
65 70 75 80 



Leu Asn Asn Asp He Met Leu He Lys Leu Lys Ser Ala Ala Ser Leu 

85 90 95 



Asn Ser Arg Val Ala Ser He Ser Leu Pro Thr Ser Cys Ala Ser Ala 
100 105 110 



Gly Thr Gin Cys Leu He Ser Gly Trp Gly Asn Thr Lys Ser Ser Gly 
115 120 125 



Thr Ser Tyr Pro Asp Val Leu Lys Cys Leu Lys Ala Pro He Leu Ser 
130 135 140 



Asp Ser Ser Cys Lys Ser Ala Tyr Pro Gly Gin He Thr Ser Asn Met 
145 150 155 160 



Phe Cys Ala Gly Tyr Leu Glu Gly Gly Lys Asp Ser Cys Gin Gly Asp 

165 170 175 



Ser Gly Gly Pro Val Val Cys Ser Gly Lys Leu Gin Gly He Val Ser 
180 185 190 



Trp Gly Ser Gly Cys Ala Gin Lys Asn Lys Pro Gly Val Tyr Thr Lys 
195 200 205 
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Val Cys Asn Tyr Val Ser Trp He Lys Gin Thr He Ala Ser Asn 
210 215 220 



<210> 15 

<211> 239 

<212> PRT 

<213> unknown 

<220> 

<223> amino acid sequence of Elastase 
<400> 15 

Val Val Gly Gly Thr Glu Ala Gin Arg Asn Ser Trp Pro Ser Gin He 
15 10 15 



Ser Leu Gin Tyr Arg Ser Gly Ser Ser Trp Ala His Thr Cys Gly Gly 
20 25 30 



Thr Leu He Arg Gin Asn Trp Val Met Thr Ala Ala His Cys Val Asp 
35 40 45 



Arg Glu Leu Thr Phe Arg Val Val Val Gly Glu His Asn Leu Asn Gin 
50 55 60 



Asn Asn Gly Thr Glu Gin Tyr Val Gly Val Gin Lys He Val Val Pro 
65 70 75 80 



Tyr Trp Asn Thr Asp Asp Val Ala Ala Gly Tyr Asp He Ala Leu Leu 

85 90 95 



Arg Leu Ala Gin Ser Val Thr Leu Asn Ser Tyr Val Gin Leu Gly Val 
100 105 110 



Leu Pro Arg Ala Gly Thr He Leu Ala Asn Ser Pro Cys Tyr He Thr 
115 120 125 
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Thr Gly Trp Gly 
130 



Gin Gin Ala Tyr 
145 



Ser Tyr Trp Gly 



Asp Gly Val Arg 
180 



Cys Leu Val Asn 
195 



Leu Thr Arg Thr 
135 



Leu Pro Thr Val 
150 



Ser Thr Val Lys 
165 



Ser Gly Cys Gin 



Gly Gin Tyr Ala 
200 



Asn Gly Gin Leu 
140 



Asp Tyr Ala He 
155 



Asn Ser Met Val 
170 



Gly Asp Ser Gly 
185 



Val His Gly Val 



Ala Gin Thr Leu 



Cys Ser Ser Ser 
160 



Cys Ala Gly Gly 
175 



Gly Pro Leu His 
190 



Thr Ser Phe Val 
205 



ser Arg Leu Gly Cys Asn Val Thr Arg Lys Pro Thr Val Phe Thr Arg 
210 215 220 

val ser Ala Tyr He Ser Trp He Asn Asn Val He Ala Ser Asn 
225 230 235 



<210> 16 

<211> 6 

<212> PRT 

<213> unknown 

<223> histidine active sites (41-46) of elasta 

<400> 16 

Met Thr Ala Ala His Cys 
1 5 



<210> 17 
<211> 6 
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<212> PRT 

<213> unknown 

<220> 

<223> histidine active sites (36-41) of trypsin 

<400> 17 

Val Ser Ala Ala His Cys 
1 5 



<210> 18 

<211> 5 

<212> PRT 

<213> unknown 

<220> 

<223> serine active sites of elastase 

<400> 18 

Gly Asp Ser Gly Gly 
1 5 



<210> 19 

<211> 5 

<212> PRT 

<213> unknown 

<220> 

<223> serine active sites (175-179) of trypsin 

<400> 19 

Gly Asp Ser Gly Gly 
1 5 



<210> 20 

<211> 120 

<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> Figure 46 - examples of outputs protein A and protein B 
<400> 20 

Ala Thr Tyr Arg Leu Lys Asn Gly Asp Asp Pro Leu Ser Asp His Lys 
15 10 15 



Leu Ser Asp Phe Gly Phe Ser Gly Lys lie Leu Thr Arg Tyr Asn Gin 
20 25 30 



Ala Val Gly Asp Asp Pro Leu Asp Gly Ala Arg Ser Asp Phe Gly Phe 
35 40 45 



Ser Gly Lys lie Pro Leu Thr Arg Tyr Asn Val Gin Asn Asp Glu Ser 
50 55 50 



Gly Phe Arg Thr Lys His Gly Leu Asp Ala Ser Asn Ala Ala Thr Tyr 
65 70 75 80 



Arg Leu Lys Asn Asp Asp Pro Leu Asp Gly Ala Arg Glu Lys Leu Ser 

85 90 95 



Asp Phe Gly Ala His Phe Ser Gly Lys lie Phe Thr Arg Tyr Asn Gin 
100 105 110 



Ala Val Gin Asn Asp Glu Ser Lys 
115 120 
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